Continuous Audio-visual Speech Recognition Continuous Audio-visual Speech Recognition

نویسنده

Juergen Luettin

چکیده

We address the problem of robust lip tracking, visual speech feature extraction, and sensor integration for audiovisual speech recognition applications. An appearance based model of the articulators, which represents linguistically important features, is learned from example images and is used to locate, track, and recover visual speech information. We tackle the problem of joint temporal modelling of the acoustic and visual speech signals by applying Multi-Stream hidden Markov models. This approach allows the use of diierent temporal topologies and levels of stream integration and hence enables to model temporal dependencies more accurately. The system has been evaluated for a continuously spoken digit recognition task of 37 subjects.

متن کامل

منابع مشابه

Audio - Visual Continuous Speech Recogni Markov Mode

With the increase in the computational complexity of recent computers, audio-visual speech recognition (AVSR) became an attractive research topic that can lead to a robust solution for speech recognition in noisy environments. In the audio visual continuous speech recognition system presented in this paper, the audio and visual observation sequences are integrated using a coupled hidden Markov ...

متن کامل

Design and recording of Czech speech corpus for audio-visual continuous speech recognition

In this paper we describe the design, recording, and content of a large audio-visual speech database intended for training and testing of audio-visual continuous speech recognition systems. The UWB05-HSCAVC database contains high resolution video and quality audio data suitable for experiments on audio-visual speech recognition. The corpus consists of nearly 40 hours of audiovisual records of 1...

متن کامل

Characteristics of the Use of Coupled Hidden Markov Models for Audio-Visual Polish Speech Recognition

This paper focuses on combining audio-visual signals for Polish speech recognition in conditions of highly disturbed audio speech signal. Recognition of audio-visual speech was based on combined hidden Markov models (CHMM). Described methods where developed for a single isolated command, nevertheless their effectiveness indicated that they would also work similarly in continuous audio-visual sp...

متن کامل

Speaker independent audio-visual continuous speech recognition

The increase in the number of multimedia applications that require robust speech recognition systems determined a large interest in the study of audio-visual speech recognition (AVSR) systems. The use of visual features in AVSR is justified by both the audio and visual modality of the speech generation and the need for features that are invariant to acoustic noise perturbation. The speaker inde...

متن کامل

Using the Multi Stream Approach for Continuous Audio Visual Speech Recognition Experiments on the M Vts Database

The Multi Stream automatic speech recognition approach was investigated in this work as a framework for Au dio Visual data fusion and speech recognition This method presents many potential advantages for such a task It particularly allows for synchronous decoding of continuous speech while still allowing for some asynchrony of the visual and acoustic information streams First the Multi Stream f...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

متن کامل

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1998

Continuous Audio-visual Speech Recognition Continuous Audio-visual Speech Recognition

نویسنده

چکیده

منابع مشابه

Audio - Visual Continuous Speech Recogni Markov Mode

Design and recording of Czech speech corpus for audio-visual continuous speech recognition

Characteristics of the Use of Coupled Hidden Markov Models for Audio-Visual Polish Speech Recognition

Speaker independent audio-visual continuous speech recognition

Using the Multi Stream Approach for Continuous Audio Visual Speech Recognition Experiments on the M Vts Database

عنوان ژورنال:

اشتراک گذاری